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(54) Dynamic reconfiguration of network servers 

(57) One or more portions (22 1 ) of a database (220) 
which a primary server (106) uses to process client re- 
quests are duplicated (211) on one or more supplemen- 
tal servers (105). As the clients demand for service in- 
creases and the processing load on the primary server 
becomes excessive (400), the primary server automat- 
ically off-loads the processing of those portions of the 
client requests that require one or more of the duplicated 
portions onto the supplemental servers by substituting 
(402) a secondary page (253) or a secondary object in 



its database that points to the one or more duplicated 
portions in the supplemental servers for a correspond- 
ing primary page (252) or a prinriary object in its data- 
base that points to the one or more duplicated portions 
in its database. The supplemental servers then serve 
the portions of the client requests that require the one 
or more of the duplicated portions. As demand for serv- 
ice decreases and the primary server becomes under- 
loaded (404). It automatically restores (406) the primary 
page or the primary object in its database and resumes 
serving the entire client requests. 
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Description 

Technical Field 

This invention relates to information networic archi- 
tecture, such as the Internet or an intranet architecture. 

Background of the invention 

In an information network, such as the Internet, us- 
er's computers, referred to as clients, request informa- 
tion from information-providers' computers, referred to 
as servers, and the servers supply the requested infor- 
mation to the clients, in the World WkJe Web (WWW), 
which is a de-facto standard for storing, finding, and 
transferring information on the Intemet, the information 
is supplied in the form of pages. A page is a display 
screen-full of information isxpressed in textual, graphi- 
cal, scriptural, and/or other form. A page comprises one 
or more information objects. An object is an information 
element that has its own network address ~ preferably 
a unique single address- called a URL (Uniform Re- 
source Locator). For example, a page may comprise 
one or more text objects, one or more picture objects, 
and one or more script objects that are presented on the 
display screen in a layout defined by a frame object. 

Typically, a server has a main page that serves as 
the entry point to the information and services that the 
server provides. This page typically points to other pag- 
es and to objects (e.g., graphic images, video/audio/text 
files, etc.). whk^h are typically served by the same serv- 
er. 

Generally, when a client accesses the server, the 
server provides the main page to the client and then in- 
teracts with the client to provide the client with desired 
additional information and/or services. As increasing 
numbers of clients access the server, the server's 
processing load increases and its performance eventu- 
ally degrades, so that users experience increasing de- 
lays between the time at which they place a request to 
the server and the time at which their request is satisfied 
by the server. 

To avoid overloading of a server, typically an admin- 
istrator must manually reconfigure the server and redi- 
rect some of the requests to other servers in order to 
lessen the load on the subject server. Some service pro- 
viders store replicas of the served information in a plu- 
rality of servers and have different ones of the servers 
serve different requests, e.g., on a round-robin basis, 
thereby spreading the load of requests over multiple 
servers. This has several disadvantages. Firstly, an ad- 
ministrator's manual Intervention is stow, inefficient, 
prone to error, and often not prompt. Secondly, using a 
plurality of servers to serve requests on a round-robin 
basis results in underutilization of the servers during pe- 
riods when relatively few requests are being made, and 
hence it is inefficient. Furthermore, it requires all server 
information to be repik:ated on each server; the servers 



cannot take advantage of a common cache for common 
data. 

Summary of the Invention 

5 

This invention is directed to solving these and other 
problems and disadvantages of the prior art. Generally 
according to the invention, a portion of the Infomratfon 
which a primary sender uses to process client requests 
10 is replicated on one or more supplemental, stand-by, 
servers, and as the clients' demand for service increas- 
es and the processing load on the primary server be- 
comes excessive, the primary sen/er automatically off- 
loads the processing of those portions of the client re- 
'5 quests that require the replicated portion of the informa- 
tion onto the supplemental servers. As demand for serv- 
ice decreases and the primary server becomes under- 
loaded, preferably the primary server autonnatically 
resumes sen/ing the entire client requests. 

20 The advantages of the Invention include the foltow- 
ing: the load-shedding and load-sharing happen auto- 
nnatically. without human inten/ention, based on the 
present processing load. Only one server, or one group 
of servers, out of the entire server set serves an individ- 

25 ual portion of the information (e.g. . a page, or an object, 
or a group of pages or objects) at any one time, which 
allows for efficient caching of the information. And more 
uniform response times are provided to clients even as 
client demand for services varies greatly. Moreover, 

30 while a standby server is not serving the primary serv- 
er's clients, its processing power may be used for other 
processing activities, such as serving other clients 
whose demand for service peaks at a time different from 
the primary sender's clients, thereby resulting in efficient 

35 server utilization. 

According to a first aspect of the invention, a client- 
server system comprises a plurality of servers for 
processing client requests, wherein at least one first 
sen/er of the plurality of servers has first information and 

40 second informatton related to the first information, for 
processing portions of the client requests that require 
the first information and portions of the client requests 
that require the second information. The at least one first 
server processes both portions of the client requests 

45 while the processing load on the at least one server is 
not excessive, e.g., does not exceed a predetermined 
first limit. In response to the processing load on the at 
least one first server becoming excessive, the at least 
one first server processes the porttons of the client re- 

50 quests whk:h require the first informatton without also 
processing the portions of the client requests which re- 
quire the second information, and automatically redi- 
rects the portions of the client requests which require 
the second information to at least one second server for 

55 processing. The at least one server of the plurality of 
servers has the second information and processes the 
redirected portions of the client requests which require 
the second information, automatically in response to the 
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redirection. Preferably, the at least one first server au- 
tomaticaily ceases redirecting the portions of the client 
requests that require the second infornnation and 
resumes processing of both portions of the client re- 
quests in response to the processing load on the at least 
one first sen/er ceasing to be excessive, e.g., falling be- 
low a predetermined second limit. 

According to a second aspect of the invention, a 
method of operating a client-server system that includes 
a plurality of servers tor processing client requests com- 
prises the following steps. While a processing load on 
at least one first server of the plurality of servers is not 
excessive, the at least one first sender processes both 
portions of client requests that require first information 
and portions of the client requests that require second 
information related to the first information; the at least 
one first server has both the first information and the 
second information. In response to the processing load 
on the at least one first server becoming excessive, the 
at least one server processes the portions of the client 
requests that require the first information without also 
processing the portions of the client requests that re- 
quire the second information, and automatically redi- 
rects the portions of the client requests that require the 
second information to at least one second server of the 
plurality of servers. In response to the redirection, the 
at least one second server automatically processes the 
redirected portions of the client requests that require the 
second infomiation; the at least one second server has 
the second information. Preferably, when the process- 
ing load on the at least one server falls below a prede- 
termined limit, the at least one sender automatically 
ceases to redirect the portions of the client requests that 
require the second information and resumes processing 
both portions of the client requests. 

These and other advantages and features of the In- 
vention will become more apparent from the following 
description of an illustrative embodiment of the invention 
taken together with the drawing. 

Brief Description of the Drawing 

FIG. 1 is a block diagram of an information network 
that includes an illustrative embodiment of the in- 
vention; 

FIG. 2 is a block diagram of partial contents of mem- 
ories of servers of the information network of FIG. 
1; and 

FIGS. 3-5 each are a flow diagram of partial oper- 
ations of a different one of the servers of the infor- 
mation network of FIG. 1 . 

Detailed Description 

FIG. 1 shows an illustrative information network, 
which in this example is the Internet. It comprises a plu- 
rality of clients 100-101 and a plurality of servers 
105-107 interconnected by the Internet network fabric 



110. Each server 105 is a computer having a processor 
and a memory, wherein the processor executes control 
programs stored in the memory to provide services and 
data stored in the memory. Each server 105-107 is a 

s primary server for a database of information A-G, re- 
spectively. According to the invention, however, besides 
being a primary server for a database, each server 
105-107 is also a secondary, or supporting, server for a 
portion of one or more other servers' databases. In the 

10 example of FIG. 1 , server 105 is a supplemental server 
for a portion BZ of server's 106 database B; server 106 
is a supplemental server for a portion AY of sender's 1 05 
database A and a portion CW of server's 107 database 
C; and server 1 07 is a supplemental server for a portk>n 

IS AX of server's 1 05 database A. 

FIG. 2 shows the data contents of memories 
205-207 of servers 105-107, respectively, that are rele- 
vant to an understanding of this invention. Memory 205 
stores database A 210 and a duplicate 211 of portion 

20 BZ 22 1 of database B 220. Memory 206 stores database 
B 220, a duplicate 222 of portion CW 231 of database 
C 230. and a duplicate 223 of portion AY 213 of data- 
base A 21 0. And memory 207 stores database C 230 
and a duplicate 232 of portion AY 21 2 of database A 21 0. 

2S Database A 21 0 comprises a primary main page A1 
240 that includes links to other pages A2-A4 243-245 in 
database A 210. Pages A3 244 and A4 245 in turn in- 
clude links to yet other pages AS 246 and A6-A7 
247-248, respectively, in database A 210. Database A 

30 21 0 further includes a secondary main page A1' 241 that 
includes links to pages A2-A3 243-244 in database 210, 
but instead of including a link to page A4 245 of portbn 
AX 212 in memory 205, it includes^ a link to duplicate 
page A4 245 of duplicate portion AX 232 in memory 207. 

3S Database A 21 0 yet further includes a tertiary main page 
A1 ■ 242 that includes a link to page A2 243 in database 
210, but instead of including links to pages A3 244 and 
A4 245 of portions AX 212 and AY 21 3 in memory 205, 
it includes links to duplicate pages A3 244 and A4 245 

40 of duplk^te portions AY 223 and AX 232, respectively, 
in memories 206 and 207, respectively. 

Database B 220 comprises a main page B1 250 that 
includes links to an object B2 251 and to another page 
B3 252 in database B 220. B3 252 is a primary page 

4S that in turn includes links to other objects and/or pages 
84-85 258-259 in database B 220. Database B 220 fur- 
ther includes a secondary page B3' 253 which, instead 
of including links to pages B4-B5 258-259 in portion BZ 
221 of database B 220 in memory 206, includes links to 

so duplicate objects and/or pages B4-B5 258-259 of dupli- 
cate portbn BZ 211 in memory 205. 

Database C 230 comprises a primary format-object 
CI 260 of a main page that includes a link to a data ob- 
ject C2 262 in portion CW 231 of database C 230. Da- 

55 tabase C 230 further includes a secondary format -object 
CV 261 of the main page which, instead of including a 
iinkto data object C2 262 in portion CW 231 of database 
C 230 in memory 207, includes a link to duplicate object 
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C2 262 of duplicate portion CW 222 in memory 206. 

It is assumed that servers 1 05-1 07 experience their 
heaviest processing loads at different times. For exam- 
ple, sen/er 105 may be most heavily used in the eve- 
nings, sender 106 may be most heavily used on work- 
days, and sen/er 107 may be most heavily used on 
weekends. Initially, all sen/ers 1 05-1 07 operate conven- 
tionally. That is. server 105 initially sen/es all requests 
for information from database A 21 0. server 1 06 initially 
serves all requests for information from database B 220, 
and sewer 107 initially senses all requests for informa- 
tion from database C 230. in a conventional manner, us- 
ing primary pages and/or objects 240. 252, and 260. As 
is also conventional, each sen/er 105-107 keeps a 
record of its present processing load, for example, in the 
form of a number of accesses (requests) sewed per unit 
of time. 

Operations of sewers 105-107 that are relevant to 
an understanding of the invention are diagrammed in 
FIGS. 3-5. respectively In addition to its conventional 
operations, each sewer 105-107 executes a load-con- 
trol program that is initialized with predetermined load 
limits. As shown in FIG. 3, sewer 105 repeatedly checks 
if its present processing load exceeds a high-load upper 
limit "AV high" at step 300. If not. it means that sewer 
1 05 is not overloaded, and so sewer 1 05 remains at step 
300; if so, it means that sewer 105 is overloaded, and 
sewer 105 therefore substitutes secondary page AT 
241 for primary page A1 240. at step 302. This has the 
effect of causing all subsequent requests for information 
from portion AX 212 of database A 210 to be directed 
to sewer 107. Sewer 107 sewes these requests in a 
_ conventional manner from duplicate portion AX 232. 
' Hence, when sewer 105 becomes overloaded, some of 
its processing load is taken over by sewer 107. 

Following step 302, sewer 1 05 checks if a high-load 
lower limit "AV low" exceeds its present processing 
load, at step 304. If so. it means that sewer 105 is un- 
derloaded, and so sewer 105 substitutes primary page 
A1 240 for secondary page AV 241 , at step 306. This 
has the effect of resuming initial operation, where sewer 
105 is sewing all requests for information from database 
A 210. Sewer 105 then returns to step 300. 

If it is determined at step 304 that limit A1' tow does 
not exceed the present load, sewer 105 checks whether 
the present load again exceeds the A1 ' high limit, at step 
308. If not. it means that sewer 105 is not overloaded, 
and so sewer 105 returns to step 304; if so. it means 
that sewer 105 is again overloaded, and sewer 105 
therefore substitutes tertiary page Al" 242 for second- 
ary page AV 241, at step 310. This has the additional 
effect of causing all subsequent requests for information 
from portion AY 21 3 of database A to be directed to sew- 
er 106. Sewer 106 sewes these requests in a conven- 
tional manner from duplicate portion AY 223, thereby 
taking on some of the processing load that would other- 
wise have to be done by sewer 1 05. and hence reducing 
the load on sewer 105. 



Following step 310. sewer 105 checks if the AV low 
limit exceeds the present processing load of sewer 1 05, 
at step 312. If not. sewer 105 remains at step 312; if so, 
it means that sewer 105 is underloaded, and sewer 105 
5 therefore substitutes secondary page A1 * 241 for tertiary 
page AV 242, at step 314. This has the effect of sewer 
105 taking back the portion of the processing lead that 
had been transferred to sewer 106 at step 310. Sewer 
1 05 then returns to step 304. 
10 The operation of sewers 1 06 and 1 07 is similar. As 
shown in FIG. 4. sewer 106 repeatedly checks if its 
present processing toad exceeds a high-load upper limit 
•B3' high", at step 400, If not, sewer 106 is not overload- 
ed and remains at step 400; if so. sewer 106 is over- 
rs loaded, and therefore it substitutes secondary page 83' 
253 for primary page 83 252, at step 402. This has the 
effect of causing all subsequent requests for informatkxi 
from portion BZ 221 of database B 220 to be directed to 
sewer 105. Sewer 105 sewes these requests in a con- 
20 ventional manner from duplicate portion BZ211 . thereby 
relieving the load on sewer 1 06. 

Following step 402, sewer 1 06 checks if a high-load 
lower limit "BS* low" exceeds its present processing load 
at step 404. If not, server 106 remains at step 404; if so, 
2S it means that sewer 1 06 is underloaded, and therefore 
sewer 106 substitutes primary page B3 252 for second- 
ary page B3* 253, at step 406. This has the effect of re- 
suming initial operation, where sewer 106 is sewing all 
requests from database 8220. Sewer 106 then returns 
30 to step 400. 

As shown in FIG. 5. sewer 107 repeatedly checks 
if its present processing load exceeds a load limit "CV". 
at step 500. If not. sewer 107 is not overloaded and re- 
mains at step 500; if so. sewer 107 is overloaded, and 
35 therefore it substitutes secondary object C V 261 for pri- 
mary object CI 260, at step 502. thereby transferring 
some of its processing load to sewer 106. Following 
step 502, sewer 107 repeatedly checks if the load limit 
"CV" exceeds its present processing load, at step 504. 
40 If not, sewer 107 remains at step 504; if so, it means 
that sewer 107 is no longer overloaded, and therefore 
sewer 107 substitutes primary object CI 260 tor sec- 
ondary object CV 261 , at step 506. thereby resuming its 
initial operation. Sewer 107 then returns to step 500. 
45 Of course, various changes and modifications to the 
illustrative embodiment described above will be appar- 
ent to those skilled in the art. For example, instead of 
storing both primary and secondary pages or objects, 
the primary pages or objects can be converted into the 
so secondary pages or objects "on-the-fly" (e.g., in real 
time), and vice versa. Likewise, instead of duplicate por- 
tions of the database being pre-stored on supplemental 
sewers, the database portions may be duplicated and 
distributed to the supplemental sewers "on-the-fly." 
I^oreover, measurements and limits other than the 
number of accesses per unit of time can be used to de- 
termine whether to off-toad or retum processing from or 
to the primary sewer. These measurements and limits 
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can be forward-looking, such as predictive algorithms 
which estimate future load based on load experienced 
at a similar time in the past. Furthermore, the main serv- 
er can request present processing load data from the 
stand-by servers and incorporate these data into its de- 
cision of whether to off-load processing to those stand- 
by servers. 



Claims 

1. A client -server system (FIG. 1) comprising: 

a plurality of seirvers (105-107) for processing 
client (100-101) requests; 
at least one first server (105) of the plurality of 
servers having first information (210) and sec- 
ond information (212) related to the first infor- 
mation, for processing portions of the client re- 
quests that require the first information and por- 
tions of the client requests that require the sec- 
ond information, while a processing load on the 
at least one sen/er is not excessive, CHARAC- 
TERISED BY the at least one first server 
processing portions of the client requests which 
require the first information (210) without also 
processing the portions of the client requests 
which require the second information (21 2) and 
automatically redirecting the portions of the cli- 
ent requests which require the second informa- 
tion to at least one second server (107) for 
processing, in response to the processing load 
on the at least one first server becoming exces- 
sive; and 

the at least one second server (1 07) of the plu- 
rality of servers having the second information 
(232) of the first and the second information, for 
processing the redirected portions of the client 
requests which require the second information, 
automatically in response to the redirection. 

2. The system of claim 1 wherein: 

the at least one second server (107) has the 
second information (232) without also having the 
first information (21 0), for processing the redirected 4S 
portions of the client requests which require the sec- 
ond information without also processing the por- 
tions of the client requests which require the first 
information. 

so 

3. The system of claim 1 wherein: 

the first information includes a primary page 
or object (240) that points to the second information 
(212) of the at least one first sen/er (105) while the 
processing load on the at least one first server is 55 
not excessive, and includes a secondary page or 
object (241 ) that points to the second information 
(232) of the at least one second server (107) in re- 



sponse to the processing load on the at least one 
first server becoming excessive. 



first information and the portions of the client 
requests which require the third information 
without also processing the portions of the cli- 
ent requests which require the second informa- 
tion and automatically redirects (302) the por- 
tions of the client requests which require the 
second information to the at least one second 
server for processing in response to the 
processing load on the at least one first server 
exceeding (300) the predetermined first limit for 
a first time, and processes the portions of the 
client requests which require the first informa- 
tion without also processing the portions of the 
client requests which require the second infor- 
mation and the portions of the client requests 
which require the third information and auto- 
matically redirects (310) the portions of the cli- 
ent requests which require the third infomnation 



4. The system of claim 2 wherein: 
s the at least one first server (105) processes 

both portions of the client requests while the 
processing load on the at least one first server does 
not exceed (500) a predetermined limit, and proc- 
esses the portions of the client requests which re- 
10 quire the first information without also processing 
the portions of the client requests which require the 
second information and automatically redirects 
(502) the portions of the client requests which re- 
quire the second information in response to the 
IS processing load on the at least one first sender ex- 
ceeding (500) the predetermined limit. 

6. The system of claim 4 wherein: 

said predetermined limit is a first predeter- 
20 mined first limit, and the at least one first sen/er au- 
tomatically ceases (506) redirecting the portions of 
the client requests that require the second informa- 
tion and resumes processing of both portions of the 
client requests in response to the processing load 
2S on the at least one first server falling below (504) a 
predetermined second limit. 

6. The system of claim 1 wherein: 

30 the at least one first server (105) further has 

third information (213) related to the first infor- 
mation (210), processes the portions of the cli- 
ent requests that require the first information 
(21 0), the portions of the client requests that re- 
35 quire the second information (212), and por- 

tions of the client requests that require the third 
information (213) while the processing load on 
the at least one server does not exceed (300) 
a predetermined first limit, processes the por- 
40 tions of the client requests which require the 
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to at least one third server (108) of the pluralrty 
ot servers tor processing, in response to the 
processing load on the at least one first server 
exceeding (308) the predetermined first limit for 
a second consecutive time; s 
the at least one third server (106) of the plurality 
of servers has the third information (223) of the 
first, the second, and the third information, for 
processing the redirected portions of the client 
requests which require the third information up- io 
on the processing load on the at least one first 
server exceeding the predetermined first limit 
for the second consecutive time; and 
the at least one first sender (105) further auto- 
matically ceases redirecting (31 4) and resumes 
processing the portions of the client requests 
that require one of the second and the third in- 
formation in response to the processing load on 
the at least one first server falling below (312) 
a predetermined second limit for a first time, 20 
and automatically ceases redirection and 
resumes (306) processing of the portions of the 
client requests that require another of the sec- 
ond and the third information in response to the 
processing load on the at least one first server 2S 
falling below (304) the predetermined second 
limit for a second consecutive time. 

A method of operating a client-server system (FIG. 
1 ) that comp rises a p lurality of servers (1 05-1 07) for 30 
processing client (100-101) requests, comprising 
the step of: 

processing both portions of client requests that 
require first information (210) and portions of 3S 
^ the client requests that require second informa- 
tion (212) related to the first information in at 
least one first server (105) of the plurality of 
servers, which at least one first server has both 
the first information and the second informa- 4a 
tion, while a processing load on the at least one 
first server is not excessive (300), CHARAC- 
TERISED BY the further steps of 
processing the portions of the client requests 
that require the first information (210) without 45 
also processing the portions of the client re- 
quests that require the second information 
(212) in the at least one first sen/er, in response 
to the processing load on the at least one first 
server becoming excessive (300); so 
autonDatically redirecting (302) the portions of 
the client requests that require the second in- 
formation to at least one second server (107) 
of the plurality of sen/ers, which at least one 
second server has the second information 55 
(232), in response to the processing load on the 
at least one first server becoming excessive 
(300); and 



automatically processing the redirected por- 
tions of the client requests that require the sec- 
ond information (232) in the at least one second 
server (107), in response to the redirection. 

8. The method of claim 7 further comprising the steps 
of: 

automatically ceasing (306) to redirect the por- 
tions of the client requests that require the sec- 
ond information in response to the processing 
load on the at least one first server failing bebw 
(304) a predetermined level; and 
automatically resuming processing of both said 
portions of the client requests in the at least one 
server, in response to the processing load on 
the at least one first server falling below (304) 
the predetermined level. 
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FIG. 1 
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(54) Dynamic reconfiguration of networic servers 

(57) One or more portions (221 ) of a database (220) 
which a primary server (106) uses to process client re- 
quests are duplicated (211 ) on one or more supplemen- 
tal servers (105). As the clients demand for service In- 
creases and the processing load on the primary server 
becomes excessive (400), the primary server automat- 
ically off-loads the processing of those portions of the 
client requests that require one or more of the duplicated 
portions onto the supplemental sen/ers by substituting 
(402) a secondary page (263) or a secondary object in 



its database that points to the one or more duplicated 
portions in the supplemental servers for a correspond- 
ing primary page (252) or a primary object in its data- 
base that points to the one or more duplicated portions 
in its database. The supplemental servers then serve 
the portions of the client requests that require the one 
or more of the duplicated portions. As demand for serv- 
ice decreases and the primary server becomes under- 
loaded (404), it automatically restores (406) the primary 
page or the primary object in its database and resumes 
serving the entire client requests. 
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